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ON A CRITERION OF SUBSTRATUM HOMOGE- 
NEITY (OR HETEROGENEITY) IN FIELD 
EXPERIMENTS 

Dr. J. ARTHUR HARRIS 

Carnegie Institution op Washington 

I. Introductory Remarks 

Every one who has had practical experience in variety 
or fertilizer tests or in any other experiments involving 
the comparison of field plots must have been impressed 
by the great difficulty of securing tracts with uniform soil 
for their cultures. 

A careful examination of the agricultural literature 
bearing on the question of variety tests will reveal many 
cases in which the experimenters have noted the difficulty 
of securing a uniform substratum, or in which there is 
internal evidence for the influence of substratum hetero- 
geneity upon the result. 

For example, in 1894-1895 tests of varieties of wheat 
were made on 77 plots at the University of Illinois. 1 As 
a check on the other strains, the variety known as Valley 
was sown on nine different plots "well distributed over 
the area sown. ' ' 

. . . the yields of this variety varied from 11.7 bushels to 24.1 
bushels, an average of 19 bushels which is remarkably close to the 
average of all the varieties. It is again remarkable that but eight yields 
were above the highest of the Valley, and but three below the lowest 
of the same variety, . . . 

The only reasonable explanations that can be given for 
such results are either (a) that the plots were so small 
that the results are due purely and simply to the errors of 
random sampling, or (6) that the wide divergences in the 

i Bull. Univ. III. Agr. Exp. Sta., 41, 1896. 
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results for the individual variety are due to substratum 
heterogeneity. 

In either ease, the results secured are obviously worth- 
less as indicating differences in the value of the individual 
varieties. 

Seventeen years ago, Larsen 2 reached the conclusion 
that the results of experimental tests were much more 
exact when a given area is divided into a large number of 
small plots upon which the tests are made than when it is 
divided into a few larger plots. 

Hall 3 has laid great emphasis upon irregularities of ex- 
perimental fields. Mercer and Hall in their interesting 
paper on "The Experimental Error of Field Trials" 4 
discuss at considerable length various phases of the influ- 
ence of soil heterogeneity upon field results. In an ap- 
pendix to their paper, Student 5 takes up the problem of 
the method of arranging plots so as to utilize to the best 
advantage a given area of land in testing two varieties. 6 

The influence of substratum heterogeneity is also read- 
ily seen in Montgomery's interesting experimental data 
for wheat. 7 

Indeed, it is quite possible that without special precau- 
tions irregularities in the substratum may have greater 
influence upon the numerical results of an experiment 
than the factors which the investigator is seeking to com- 
pare. Elsewhere 8 I have shown that the differentiation 

2 Larsen, B. B.., "Andra nordska Landbrakskongressen i Stockholm," 
1897, I, p. 72; fide G. Holtermark and B. E. Larsen, Lanwirtscliaftl. Ver- 
such-Stationen, 65, 1, 1907. 

s Hall, A. D., "The Experimental Error of Field Trials," Journ. Board 
Agr. Great Britain, 16, 365-370, 1909. 

* Journ. Agr. Set., 4, 107-127, 1911. 

s Student, Journ. Agr. Sci., 4, 128-132, 1911. 

6 For several years, I have in careful tests labelled each seed individually 
and scattered them at random over the field to eliminate the influence of soil 
heterogeneity 

t Montgomery, E. G., "Variation in Yield and Method of Arranging 
Plots to Secure Comparative Results, " Ann. Hep. Neb. Agr. Sta., 25, 164-180, 
1912. 

s Harris, J. Arthur, ' ' An Illustration of the Influence of Substratum 
Heterogeneity upon Experimental Results," Science, N. S., 38, 345-346. 
1913. 



432 THE AMEBIC AN NATURALIST [Vol. XLIX 

in an apparently uniform garden plot may be sufficient to 
mask entirely the influence of the weight of the seed 
(PJiaseolus vulgaris) planted upon the size of the plant 
(as measured by the number of pods) produced. It is 
very probable that certain pure-line experiments and con- 
clusions are entirely invalidated by the fact that the in- 
fluence of irregularities in the substratum were not suffi- 
ciently guarded against. 9 

Several authors have tried to obtain some measure of, 
or some corrective term for, substratum heterogeneity. 
For example, Mercer and Hall (loc. cit.) have plotted the 
yields across the field in both directions. Such methods, 
however, give but a very imperfect idea of irregularities 
in the soil. Heterogeneity is perhaps more likely to occur 
as a spotting of the field than as a relatively uniform 
change from one side to the other. This is clearly indi- 
cated in the diagrams published by Montgomery. The 
mere plotting of yields in any line across the field can not 
adequately take into account such irregularities. Fur- 
thermore, some quantitative measure (and some probable 
error of this measure) of the amount of irregularity, not 
. merely a demonstration of its existence, is required. 

Some generally applicable measure of the degree of 
homogeneity of the soil of a field (as shown by actual ca- 
pacity for crop production) seems highly desirable. Such 
a criterion should be universally applicable, comparable 
from species to species, character to character or experi- 
ment to experiment, and easy to calculate. 

I believe we may proceed as follows. Suppose a field 
divided into N small plots all planted to the same variety 
of plants. Let p be the yield of an individual plot. The 
variability of p may be due purely and simply to chance, 
since the individuals of any variety are variable and the 
size of the plots is small, or it may be due in part to differ- 
entiation in the substratum. If the irregularities in the 
experimental field are so large as to influence the yield of 

"See "The Distribution of Pure Line Means," Amee. Nat., 45, 686- 
700, 1911. 



No. 583] ON SUBSTRATUM HOMOGENEITY 433 

areas larger than single plots 10 they will tend to bring 
about a similarity of adjoining plots, some groups tend- 
ing to yield higher than the average, others lower. 

Now let the yields of these units be grouped into m 
larg-er plots, C P) each of n contiguous ultimate units, p. 
The correlation between the p'a of the same combination 
plot, C P , will furnish a measure (on the scale of to 1) 
of the differentiation of the substratum as expressed in 
capacity for crop production. If this correlation be sen- 
sibly 0, the irregularities of the field are not so great as 
to influence in the same direction the yields of neighboring- 
small plots. As substratum heterogeneity becomes greater, 
the correlation will also increase. The size of the co- 
efficient obtained will depend somewhat upon the nature 
of the characters measured, somewhat upon the species 
grown, and somewhat upon the size of the ultimate and 
combination plots. A knowledge of the values of the cor- 
relation to be expected must be determined empirically. 

Fortunately, very simple formulae are now available for 
calculating such coefficients. 11 

Let S indicate a summation for all the ultimate or com- 
bination plots of the field under consideration, as may be 
indicated by the capital G f or lower case p. Then in our 
present notation which is as much simplified as possible 
for the special purposes of this discussion 

= mC* 2 ) - S(p*)]lm[n(n - 1)]} -p 2 

where P is the average yield of the ultimate plots and <r P 
their variability, and n is constant throughout the m com- 
bination plots. 12 

10 Irregularities of soil influencing the plants of only a single small plot 
may in most work be left out of account, since they are of the kind to 
which differences between individual plants are to a considerable extent 
due, and are common to all the plots of a field. 

n Harris, J. Arthur, ' ' On the Calculation of Intra-class and Inter-class 
Coefficients of Correlation from Class Moments when the Number of Pos- 
sible Combinations is Large," Bioinetrika, 9, 446-^72, 1913. 

12 por the benefit of those who are frightened by f ormulse, it may be 
paraphrased as follows: One merely adds together the yields of a chosen 
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Ultimately, I hope to present experimental data of my 
own bearing on this problem. For the present, the method 
is admirably illustrated by a number of published records. 

II. Illtjstbations of Method 
A. Cases in which the Combination Plots are Equal in 

Size 
Illustration 1. Influence of substratum heterogeneity 
on yield of experimental plots of mangolds. 

TABLE I 

Yield of Combination Plots fob Mangolds, Obtained by Combining the 

Entries of Map A by Fours as Indicated by the Heavier Lines 



1,209 
172 


1,175 
183 

1,321 
191 

1,333 

188 

1,272 
177 


1,215 
171 


1,239 
175 


1,276 
205 


1,250 
185 


1,274 
187 


1,293 
184 

1,290 
185 


1,310 
207 


1,204 
159 


1,268 
172 

1,222 
167 


1,268 
200 


1,300 
172 


1,272 
173 


1,388 
215 


1,385 
193 


1,375 
194 


1,314 
193 


1,260 
180 


1,373 
219 


1,380 
204 


1,387 
202 


1,309 
177 


1,314 

188 

1,332 
194 


1,380 
229 


1,320 

180 


• 1,295 

188 


1,304 
187 


1,397 
226 


1,331 
183 


1,264 
183 


1,310 
188 


1,325 
183 


1,337 
203 


1,404 
194 


1,325 
190 


1,334 
190 


1,335 
192 


1,312 

211 


1,418 
193 


1,373 
196 


1,339 
189 


1,403 
198 


1,401 
226 



number of contiguous p plots to form a number m of C P plots. The sum 
of the squares of p is subtracted from the sum of the squares of C v and 
the result divided by ■>»[»(» — 1)] where n is the number of ultimate 
plots in each of the in combination plots. The quotient is reduced by sub- 
tracting the square of the mean yields of the ultimate plots, p, and the 
remainder divided by the square of the standard deviation of yields of 
ultimate plots, <Tp-. The quotient is the correlation between the yields of 
the ultimate units, p, of the same combination plot, C v , — the measure of 
heterogeneity required. Thus the calculation of the criterion is very simple 
indeed. 
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Map A. 



Pounds per Plot of Roots and Leaves of Mangolds, 
and Hall. 



Data of Mercer 
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Map A represents the Rothamsted field of mangolds 
grown by Mercer and Hall (loc. cit.). The upper entries 
are for pounds of roots, the lower for pounds of leaves. 

I now reduce the 200 areas to 50 by combining the ad- 
joining plots by fours, as indicated by the heavier lines on 
the map. Thus for leaves the Southwest combination 
plot, C P , is 67 + 52 + 56 + 51 = 226. Table I gives the 
result. 

This gives for roots : 

S(p) = 65715, S(p?) = 21674871, N = 200, 

p = 328.575, cr/ = 412.824, 13 

S(C P 2 ) = 86537439, m[n(n - 1)] = 50 X 4 X 3 = 600, 

[S(C P 2 )—S(p 2 )]/m[n(n~ 1)] =108104.280, 

and 

_ 108104.280 - 03 28.575)3 _ u 

'piPa — 412 824 — -u*^- 

The results for yield of leaves are 

S(p) =9541, S(p 2 ) = 45941, N = 200, 

^=47.705, a/ = 23.938, 

S(Cp 2 ) = 1832095, m[n(n - 1)] =50 X 4 X 3 = 600, 

[S(Cp 2 ) -S(p*)]/m[n(n-l)] =2286.923, 

whence 

2286.92 3 - (47.705) 2 
r PlP , = 2 3 T9 -3 8 = -466 ± .037. 

Illustration 2. Influence of Substratum Heterogeneity 
upon the Yield of Straiv and Grain in Experimental Plots 
of Wheat. 

13 The standard deviation is most conveniently calculated in cases like the 
present, in which one requires the summed squares of actual values for other 
purposes from 

i* The probable errors have in all cases been calculated upon the actual, 
not the weighted, number of ultimate plots as N. 
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The wheat field of Mercer and Hall is divided into 
25x20 = 500 plots, Map B. Combining the plots by- 
fives from east to west and by fours from north to south, 
I have condensed this into 5X5 = 25 Cp plots, each of 
20 ultimate plots as shown in Table II. 



Yields of Combination 
Original Areas 


TABLE II 
Plots op Eothamsted Wheat, 
Separated by Double Lines in 


4X5 

'. Map 


Grouping. 
B 


82.89 
139.36 


83.05 
132.41 


7S.63 
122.84 




78.76 
120.53 


i 


74.70 
114.58 


78.15 
130.60 


84.34 
140.31 

84.70 
j 149.58 


75.61 
120.11 


80.32 
119.27 


74.87 
112.21 


79.80 
133.31 


74.94 

125.27 

73.60 
131.80 

82.25 
138.42 


81.50 
133.28 

71.35 
121.18 

68.52 
120.09 


77.34 
120.09 


84.36 
142.79 


, 82.42 
147.74 


75.81 
122.02 


85.19 
147.95 


84.56 
146.78 


76.69 

124.8S 



Summing the actual yields and the squares of yields 
for the ultimate plots and the squares for the combination 
plots, I find the following values : 

For wheat grain 

8{p) = 1974.32, S(p 2 ) =7900.6790, # = 500, 

p = 3.949, V = .209600, 

SiC*) = 156419.3106, m [n(n - 1) ]= 25 X 20 X 19 

= 9,500, 

[8(Cp 2 ) -S(p 2 )]/m[n(n-l)] =15.633540, 

which leads to 

15.633540 - (3.949) 2 1Qra _ n9Q 

. »W= = - ".209600" " = 8 ± '° 9 ' 

For wheat straw 

8(p) = 3257.40, S(p 2 ) = 21623.9802, N = 500, 

p = 6.515, V = .805341, 

S(C P * = 427479.9920, m[n(n~ 1)] =9500, 

[8(Cp 2 ) — S(p*)]/m[n(n-l)] =42.721685, 
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■ . _ 42.72168 5 - (6.515)-- _ 
Wa ~ .805341 --343 ±.02;. 

Illustration 3. Influence of Substratum Heterogeneity 
upon Yield of Grain and Nitrogen Content in Experi- 
mental Plots of Wheat. 

Table III is condensed from Map C of Montgomery 15 
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/\ /\ /\ 

Map C. Yields of Grata in Grams and Percentage of Nitrogen in Montgomery's 

Wheat Plots. The tipper Entries are Yield in Grams of Grain, the Lower 

are Percentage Nitrogen Content. 

15 Montgomery, E. G., "Variation in Yields and Method of Arranging 
Plots to Secure Comparative Results," Ann. Hep. Neb. Agr. Exp. Sta., 25, 
164-177, 1912. 
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by combining adjoining plots 2x2. The following are 
the numerical values. 
For grains produced, 

S(p) = 123429, S(p 2 ) = 70112319, N = 224, 

p = 551.022, <7 P - = 9375.826, 

S(C P 2 ) = 277945243, m[n(n - 1)] =642, 

[S(C v *)—S(p 2 )]/m[n(n - 1)] = 309275.184, 

whence 

r PlP2 = .603 + .029. 

For percentage nitrogen, 

8(p) = 465.29, 8(p s ) = 968.3721, N = 224, 
p = 2.077187, cr.2 = .008383, 

S(G P 2 ) = 3868.5047, m[»(» - 1)] = 672, 
[^(C 2 ) -/5(^ 2 )]/m[«(»-l)] =4.315673, 



and 



r PlJ ,, = .115 + .044. 



TABLE III 

Combination Plots op Montgomery's Wheat, 2 x 2-Fold Grouping as 
Indicated by Heavy Lines in Map C 
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Illustration 4. Influence of Substratum Heterogeneity 
upon the Yield of Experimental Plots of Timothy Hay. 

I take as a final illustration of the application of the 
criterion of substratum heterogeneity here proposed, the_ 
plot data for timothy hay published by Holtermarck and 
Larsen, loc. cit. By combining their plots into groups of 
4 Table IV is secured, 

S(p) = 4268.8, S(p 2 ) =77968.50, iV = 240, 

p = 17.787, op 2 = 8.503, 

S(Cp*) =309491.48, m[n(n — 1)] =720, 

whence 

r Pll , 2 = .609 + .027. 



TABLE IV 

Combination Plots 2 X 2, Showing Yields of Timothy Hat Secured in 

the Experiment of Larson 

The original field is not mapped here 
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B. Cases in which the Combination Plots Vary in Size 

In the foregoing illustration the combination plots have 
been of uniform size, i. e., have contained each the same 
number of ultimate plots. It may be desirable or neces- 
sary to have some of the combination plots smaller than 
the others. Thus the wheat field of Mercer and Hall is 
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laid out in a 20 X 25 manner. This permits only 2x5, 
4 X 5 or 5 X 5 combinations of the same size throughout. 
Montgomery's experiment comprises an area of 16x14 
plots which may be combined in only 2 X 2 or 4 X 2 equal 
areas suitable for calculation. In each of these cases 
other groupings are desirable. 

The formulae are quite applicable to such cases : the 
arithmetical routine is merely a little longer. The for- 
mula is again 



{{SiC,?) - S(f~)]/S[n(n - 1)]} - Jy 



but p and a P are obtained by a (n — l)-fold weighting of 
the plots, 16 where n is the number of ultimate plots in the 
combination plot to which any p may be assigned, i. e., 

p = S[(n-l)p]/S[n(n-l)], 

2 _ S[(n - l) p a ] _ ( S[(n-l )p]\* 
a " ~ S[n(n-1)] \S[n(n-i)])' 

The point may be illustrated in detail on the wheat data 
of Mercer and Hall. I adopt a combination by twos from 
north to south, i. e., arrange the data in 10 rows of com- 
bination plots instead of 20 rows of ultimate plots. From 
east to west there are 25 rows of ultimate plots ; these can 
be only reduced to a 2 X 2-fold grouping for the first 22 
rows. The lines of division are indicated on Map B by 
marginal arrows. 

Row 23-25 must be thrown into combination plots each 
of 6 units. The possible permutations within a combina- 
tion plot are 1/2 n(n — 1), but since the surfaces are ren- 
dered symmetrical, the total permutations for the whole 
field is 8[n(n — 1)]. There are only two sizes of combi- 
nation plots, of which 110 have 4 and 10 have 6 ultimate 
plots each. Thus the weighted population N is 

if That is, each ultimate plot is multiplied by the number less one of the 
plots in the combination plot to which it is assigned. 



444 THE AMEBIC AN NATURALIST [Vol. XLIX 

S[»(«-l)] = (110X4X3) + (10X6X5) =1620. 
In the calculation of the weighted means and standard 
deviations each entry, and the square of each entry, in the 
first 22 rows must be weighted in an (n — 1) -fold = 3-fold 
manner, while those for the last three rows must be 
weighted in a 5-fold manner. 17 

The numerical values are : 

For grain, 

8[(n-l)p] =6378.72, S[(n — l)p 2 ] =25452.4154, 
p = 3.937, <v> = . 207610, 

8(C P 2 ) = 33129.7080, 8(p 2 ) = 7900.6790, 
whence 

r„ lP2 = .354 + .026. 

Note that S(p 2 ) is constant for all groupings. 
For straw, 

8[ (n - l)p] = 10474.52, 8[ (» - l)p 2 ] = 69042.7194, 

p = 6.466, <r P 2 = . 813000, 

8(C p a ) = 89985.8976, 8(p 2 ) = 21623.9802, 

whence 

r W2 = .479 + .023. 

Weighting has not materially changed the physical con- 
stants from the values given under illustration 2 above. 
The reasons for the conspicuous differences in the corre- 
lations will be taken up presently. 

Montgomery's wheat data have been grouped into 2 X 2- 
fold combination plots in the illustration above. If we 
again combine the entries of Table III by twos, beginning 
at the upper left-hand corner, we have 12 combination 
plots each 4x4, or of 16 ultimate plots, and 4 combina- 

1' Since each individual ultimate plot is compared once as a first (or as 
a second) number of a pair with every plot classed with it, the weighting 
of the individual plots for means and standard deviations is an (n — 1)- 
fold one. 
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tion plots each of 2 X 4 = 8 ultimate plots. The method 
of dividing up the field' is indicated hy the marginal ar- 
rows on Map C. 

8[n(n - 1)] = (12 X 16 X 15) + (4x8x7)= 3104. 
For grain, 

8[(n-l)p] =1707635, 8[(n- l)p?] =9683408.57 
p = 550.140, V = 9311.307, 
S(G P 2 ) =1023184887, S(p 2 ) = 70112319, 
whence 

r nP2 = .472 + .035. 
For nitrogen, 

8 [in - l)p]= 6458,63, S[(n - l)p 2 ] = 13464.6031, 
p = 2.080744, <7 P 2 = .008327, 
S(C V 2 ) =14409.6095, 8(p 2 ) =968.3721, 
and 

r PlP2 = .096 + .045. 

Again the weighted means and standard deviations do 
not differ widely from those used above. The differ- 
ences in the correlations will be discussed below. 

In concluding this section it may be pointed out that 
all of the foregoing values are surprisingly high. They 
indicate clearly that the irregularities of an apparently 
uniform field may influence profoundly the yield of a 
series of experimental plots. They also bring out an- 
other interesting point. In the three cases in which two 
different characters were measured on the same species 
they show very different susceptibilities to environmental 
influence. Thus, for example, the correlation of man- 
gold roots is r = .346 + .042 as compared with r = .466 + 
.037 for leaves. For grain on the Rothamsted field with a 
4 X 5-fold grouping the correlation is r = . 186 + .029 as 
compared with r = .343 + .027 for straw. For Montgom- 
ery's data for yield and composition the differences are 
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even more conspicuous. The correlation for per cent, ni- 
trogen is r = . 115 + .044 as compared with r=.603 + .029 
for weight of grain produced. 

This point will not be discussed in greater detail here, 
since the problem of the relative susceptibility of various 
characteristics of the individual to environmental influ- 
ence has been the subject of experimental and statistical 
studies which have been under way for several years and 
will probably eventually be published. 

III. On the Nature op the Regression oe Associated 

Plots 

The correlation coefficient is strictly valid as a measure 
of interdependence only when regression is linear, i. e., 
when the means of the second variable associated with 
successive grades of the first lie in a sensibly straight line. 
The equation for the regression straight line 
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for the second on the first ultimate plot of the same com- 
bination plot reduces to 

p=(p — rp) -f rp, 

when symmetrical tables in which p 1 = p 2 , a- Pl = <r P2 are 
used. 

The testing of the linearity of regression in any indi- 
vidual case is rendered somewhat difficult by the necessity 



TABLE V 
Yield of Grain in Eothamsted Wheat Experiment 



Yield of First 
Plot 


Weighted 
Frequency 


Mean Yield 

of Associated 

Plots 


Yield of First 
Plot 


Weighted 
Frequency 


Mean Yield 

of Associated 

Plots 


2.75-2.99 


133 


3.76 


4.00-4.24 


1786 


3.99 


3.00-3.24 


475 


3.78 


4.25-4.49 


1444 


4.07 


3.25-3.49 


1026 


3.81 


4.50-4.74 


703 


4.04 


3.50-3.74 


1634 


3.89 


4.75-4.99 


247 


4.05 


3.75-3.99 


1919 


3.93 


5.00-5.24 


133 


4.16 
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of actually forming a correlation table from which to com- 
pute the means of arrays: The labor is greatly lessened 
by the use of some such scheme as that described for the 
formation of condensed correlation tables. 18 

TABLE VI 
Yield of Straw in Bothamsted Wheat Experiment 



Yield of First 


Weighted 


Mean Yield 


Yield of First 


Weighted 


Mean Yield 


Plot 


Frequency 


Plots 


Plot 


|Frequency 


Plots 


4.00-4.24 


19 


6.11 


6.50-6.74 


608 


6.56 


4.25-4.49 


19 


5.68 


6.75-6.99 


817 


6.69 


4.50-4.74 


133 


6.08 


7.00-7.24 


779 


6.86 


4.75-4.99 


171 


6.07 


7.25-7.49 


665 


6.84 


5.00-5.24 


304 


6.19 


7.50-7.74 


627 


7.04 


5.25-5.49 


418 


6.13 


7.75-7.99 


323 


6.96 


5.50-5.74 


722 


6.18 


8.00-8.24 


247 


7.14 


5.75-5.99 


1121 


6.20 


8.25-8.49 


57 


7.09 


6.00-6.24 


1273 


6.31 


8.50-8.74 


152 


6.75 


6.25-6.49 


969 


6.38 


8.75-8.99 


76 


7.28 
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Figs. 1 and 2. Mean Yields of Grain and Straw on Ultimate Plots Asso- 
ciated in the Same Combination Plots of a Given Yield. Rothamsted Wheat. 
Empirical Means and Fitted Straight Line. Units are Quarters of a Pound. 

is Harris, J. Arthur, ' ' On the Formation of Condensed Correlation Tables 
when the Number of Combinations is Large,'' Amer. Nat., 46, 477-486, 
1912. 
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For the 5X4 grouping of the 500 wheat plot of Mercer 
and Hall I find the values given in Tables V-VI. 

For the regression of the second on the first plot the 
equations are : 



For grain, g, 
For straw, s, 



# 2 =3.214 + . 186 T 
s 2 =4.280 + . 343 s T , 



Figs. 1 and 2 exhibit the usual irregularities of sam- 
pling in the means, but show no certain departure from 
linearity. 

TABLE VII 

Yield of Grain in Montgomery's Wheat Experiment 



Yield of First 


Weighted 


Mean Yield 


Yield of First 


Weighted 


Mean Yield 
of Associated 


Plot 


Frequency 


Plots 


Plot 


Frequency 


Plots 


325-374 


9 


516.88 


575-624 


111 


579.82 


375-424 1 


63 


440.22 


625-674 


90 


616.21 


425-474 


93 


471.23 


675-724 


45 


656.37 


475-524 


108 


540.24 


725-774 


30 


628.80 


525-574 


120 


548.24 


775-824 


3 


574.00 
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F/4l/i-& 3 
Fig. 3. Grain Yields in Nebraska Wheat. See Figs. 1-2 for Explanation. 
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Table VII gives the first plot character, weighted fre- 
quencies and empirical means for associated plots for 
2 X 2-fold combinations from Montgomery's grain yield 
data in wheat. 19 

The equation is 

For grain, g, g x = 218.993 + .603 g x . 
The graph figures indicate sensible linearity. 

IV. Influence oe Number oe Ultimate Plots Combined 

If an experimental field exhibit irregularities of condi- 
tions which influence in a measurable degree the yield of 



TABLE VIII 

5 X 2-Fold Combination op Plots of Rothamsted Wheat 
The divisions of the field are indicated by the double vertical lines and 
the arrows along the right margin in map B. 



41.11 
68.19 


42.51 
66.59 

40.54 

65.82 


40.32 
62.22 

38.31 
60.62 

37.77 
60.65 


38.53 
59.52 


36.65 
54.45 


41.78 
71.17 


40.23 
61.01 


38.05 
60.13 


40.3.5 
69.10 


41.92 
70.37 


40.01 
58.10 

40.31 
61.17 


39.48 
5S.00 


37.80 
61.50 


42.42 
69.94 

42.03 
71.09 


37.84 
59.46 


35.39 
54.21 


40.42 
65.95 


36.69 
59.97 


41.84 
64.09 


38.83 
56.99 


39.38 
67.36 


42.67 

78.49 


38.25 
65.30 


39.66 
69.19 


38.51 
63.10 


42.77 
71.9.5 


42.17 
75.20 

40.25 
72.24 


38.07 
66.92 


38.05 
64.05 


40.22 
63.45 


41.59 
70.S4 


35.53 

64.88 

40.12 
67.99 


33.30 
57.13 

34.00 
60.55 

34.52 
59.54 


35.59 

58.57 


41.75 
71.84 


41.44 
71.92 


38.13 
62.36 


43.44 
76.11 


43.12 
74.86 


42.13 
70.43 


38.53 
62.52 



19 Because of the many differences in the two experiments it is inadvisable 
to attempt drawing the regressions lines in a strictly comparable form. 
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neighboring small experimental plots, this heterogeneity 
should become apparently less when expressed on a scale 
of correlation between plots as the number of ultimate 
plots combined increases. The reason for this condition 
is quite simple. If the irregularities are very local in 
nature they will influence in the same direction the yield 
of only a very few neighboring plots. If too many ulti- 
mate plots be combined the correlation will tend to vanish 
because of the increased frequency of association of un- 
like conditions due to the fact that the combination plots 
have been made so large that they themselves have become 
heterogeneous. 

That these conditions have been observed in actual ex- 
perimentation is shown by the following constants based 
on different groupings of the data used above. 

Consider first the Eothamsted wheat. For a 4 X 5 
grouping of the plots the results were found to be 

For grain, r Pl , 2 = .186 + .029, 

For straw, r l>m = .343 + .027. 

If the plots be grouped by fives from east to west and 
by twos from north to south, Table VIII is obtained. The 
values 8(p 2 ), p and o> are the same as in the preceding 
case. 

m[n(n — 1)]=50X10X9 = 4500. 

For grain, S(C P *) = 78265.2822, r ni>2 = .214 + .029. 

For straw, S(C„ 2 ) = 213939.8774, r nH = .365 + .026. 

If the combination plots be made even smaller by group- 
ing in a 2 X 2-fold manner for all but the last three north 
and south rows, where a 2 X 3-fold combination must be 
adopted, the results are, as illustrated above, 

For grain, r nn = .354 + .026, 

For straw, r Pl „ 2 = .479 + -023. 

For Montgomery's wheat data the results for a 4 X 4- 
fold grouping (in as far as the nature of the records will 
permit) have been shown to be 
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For grain, r nP2 = .472 + .035, 

For nitrogen, r n „ 2 = .096 + .045, 

as compared with the following values for a 2 X 2-fold 
grouping 

For grain, r Pm = .603 + .029, 

For nitrogen, r Pl „ 2 = .115 + .044. 

Finally consider the constants deduced from the hay 
yields published by Holtermark and Larsen. 

For a 2 X 2-fold grouping, r PlP2 = .609 + .027, 
For a 2 X 4-fold grouping, r Pin = All + .034, 
For a 2 X 8-fold grouping, r ni>2 = .278 + .040. 

Thus for every species of plant and every character con- 
sidered the correlation between associated ultimate plots 
decreases as the number of plots grouped increases. 20 



.TABLES IX AND X 

2 X 4-Fold and 2 X 8-Fold Combination of the Data for Plot Yield 
in Timothy Hay, Tables Derived prom Table IV 



163.8 


169.2 


153.5 


138.9 


134.9 


123.0 


142.0 


119.3 


130.6 


188.6 


155.4 


128.6 


130.9 


120.3 


123.3 


164.2 


155.0 


135.9 


129.8 


139.1 


121.5 
128.6 


125.0 


159.1 


150.8 


146.3 


150.0 


164.6 


146.7 


129.9 



305.8 


288.5 


284.1 


352.8 


310.4 


264.5 


260.7 


259.4 


244.8 


289.6 


305.8 


280.7 


285.2 


284.9 


251.6 









so Of course, the same effect would he produced if comparisons were 
drawn between tests for substratum heterogeneity on fields comparable 
in every regard except for the size of the ultimate plots. Possibly, this ex- 
plains in part, at least, the striking differences in the correlations for grain 
yield found from the records of Montgomery and of Mercer and Hall. 

The Eothamsted plots were l/500th acre in area or 87.12 square feet. 
Montgomery's plots were 5.5 X 5.5 = 30.25 square feet, or only about 1/3 
of the area of the Eothamsted plots. 
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V. Recapitulation and Discussion 

If the methodical production of new varieties of animals 
and plants to be made possible by the laws discovered 
in experimental breeding is to be of material practical 
value, more attention must be given to the development 
of a standardized scientific system of variety testing. 
From the practical standpoint, nothing is to be gained by 
the formation of varieties of plants differing in discern- 
ible features of any kind unless some of these varieties 
can by rigorous scientific tests be shown to be of superior 
economic value. 

It is equally true that if tests of fertilizers or of dif- 
ferent methods of irrigation carried out on an experi- 
mental scale are to have any real value as a guide to a com- 
mercial practise, the differences in the experimental re- 
sults must certainly be significant in comparison with 
their probable errors. 

The problem of plot tests has several different phases, 
all of which must ultimately receive careful investigation. 
The purpose of this paper has been to consider one of the 
problems only. To what extent do the irregularities of 
an apparently homogeneous field selected for comparative 
plot tests influence the yield of the plots 1 

The question has been far too generally neglected, 
although indispensable to trustworthy results. It is ob- 
viously idle to conclude from a given experiment that va- 
riety A yields higher than variety B, or that fertilizer X 
is more effective than fertilizer Y, unless the differences 
found are greater than those which might be expected 
from differences in the productive capacity of the plots 
of soils upon which they were grown. 

The first problem has been to secure some suitable 
mathematical criterion of substratum homogeneity (or 
heterogeneity). Such a criterion should be expressed on 
a relative scale ranging from to 1, in order that com- 

Tlie 2 X 2-fold grouping of Montgomery's plots gives a correlation of 
.603 ± .029 as compared with r= .354 ± .026 for as nearly a perfect 2X2- 
fold grouping as the Kothamsted records permit. 
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parisons from field to field, variety to variety or character 
to character, may be directly made. It should also, if pos- 
sible, offer no difficulties of calculation. 

The criterion proposed is the coefficient of correlation 
between neighboring plots of the field. An exceedingly 
simple formula for the calculation of such coefficients has 
been deduced. 

The method of application of this coefficient is here il- 
lustrated by four distinct series of experimental data. 

The remarkable thing about the results of these tests 
is that in every case the coefficient of correlation has the 
positive sign and that in some instances it is of even more 
than a medium value. In short, in every one of these ex- 
perimental series the irregularities of the sub stratum have 
been sufficient to influence, and often profoundly, the ex- 
perimental results. 

It might be objected that by chance, or otherwise, the 
illustrations are not typical of what ordinarily occurs in 
plot cultures. But they have been purposely drawn from 
the writings of those who are recognized authorities in 
agricultural experimentation, and who have given their 
assurance of the suitability of the fields upon which the 
tests were made. 

For example, Mercer and Hall state the purpose of 
their research to be, "to estimate the variations in the 
yield of various sized plots of ordinary field crops which 
had been subjected to no special treatment and appealed 
to the eye sensibly uniform." Their mangolds "looked 
a uniform and fairly heavy crop for the season and soil, ' ' 
while in their wheat field "a very uniform area was se- 
lected, one acre of which was harvested in separate plots, 
each one five hundredth of an acre in area." The data 
of Larsen were drawn from an experiment "auf einer 
clem Auge sehr gleiehmiissig erscheinenden, 3 Jahre alten 
Timotheegraswiese. " Montgomery's data were secured 
from a plot of land only 77 X 88 feet in size, which had 
been sown continuously to Turkey Eed wheat for three 
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years, "and was of about average uniformity and fer- 
tility." 

Nothing could, it seems to me, emphasize more emphat- 
ically the need of a scientific criterion for substratum homo- 
geneity than the facts that correlations between the yields 
of adjacent plots ranging from r = .115 to r = .609 can be 
deduced from the data of fields which have passed the 
trained eyes of agricultural experimenters as satisfac- 
torily uniform. 

December 12, 1914 



